This preface contains:
The following are changes in Oracle Big Data Appliance release 4 (4.2):
Software Upgrades
Cloudera's Distribution including Apache Hadoop 5.4.0
Cloudera Manager 5.4.0
Perfect Balance 2.4.0
Oracle Big Data SQL 1.1
No SQL Database 3.2.5
Oracle Linux 6.6 and 5.11
JDK 8u45
Hardware Upgrades
Oracle Big Data Appliance is now shipped with 8 TB disk drives
Elastic Configuration
Oracle Big Data Appliance now provides the flexibility of adding one or more servers on a starter rack using Big Data Appliance X5-2 High Capacity Nodes plus InfiniBand Infrastructure. You can add up to 12 additional servers on a starter rack.
See Chapter 8, "Expanding an Oracle Big Data Appliance Starter Rack".
Automatic Installation Support
Spark-on-YARN is deployed automatically
Oracle Spatial and Graph is installed and configured automatically
Oracle Big Data SQL 1.1
Copy to BDA
This utility enables you to copy relatively static tables from an Oracle database into Hadoop, with the purpose of improving query times.
Oracle NoSQL Database Support
Oracle databases on Oracle Exadata Database Machine can use Oracle Big Data SQL to connect to clusters running Oracle NoSQL Database.
Parquet Support
CDH 5.2 and later versions include Hive 0.13, which supports the Apache Parquet file format. This file format is used by Cloudera Impala and other Hadoop software.
Oracle Big Data Appliance X5-2
Oracle Big Data Appliance 4.2 software supports Oracle Big Data Appliance X5-2 and earlier version server hardware.
See "Server Components".
Oracle Big Data Appliance Configuration Generation Utility
This utility generates two new configuration files:
network.json
: Supersedes BdaDeploy.json
. For software upgrades, Mammoth converts the existingBdaDeploy.json
to network.json
. New installations must have network.json
.
networkexpansion.json
: Supersedes BdaExpansion.json
.
CDH Deployment
Mammoth uses parcels instead of RPMs to deploy CDH.
Apache Sentry
Installation of Apache Sentry does not require sentry-provider.ini
as a prerequisite.
Microsoft Active Directory Server in Mammoth
Support for directly using Microsoft Active Directory named as Active Directory Kerberos in Mammoth.
Oracle Linux Support
Oracle Linux 5 support for Oracle Big Data Appliance X5-2 servers.
Cloudera Navigator Trustee Server
Cloudera Navigator Trustee Server installer package and documentation are now shipped in Mammoth. It must be manually installed on a separate server.
The following features are deprecated in this release, and may be desupported in a future release:
Mammoth Reconfiguration Utility
The bdacli
utility supersedes mammoth-reconfig
. The mammoth-reconfig
utility is only needed to change the disk encryption password.
See "bdacli".
MapReduce 1 (MRv1)
YARN (MRv2) supersedes MRv1. Users who want to continue to use MRv1 on Oracle Big Data Appliance versions 3.x and 4.x should contact Oracle Support before using Mammoth to patch or upgrade the software.
Disk Encryption
A new encryption system that is more flexible and robust will replace the current system in an upcoming release.
The following are changes in Oracle Big Data Appliance release 4 (4.1):
Software Upgrades
Cloudera's Distribution including Apache Hadoop 5.3.0
Cloudera Manager 5.3.0
Perfect Balance 2.3.0
Oracle Big Data SQL 1.1
Oracle Big Data Connectors 4.1
Oracle Linux 6.5
Oracle Big Data SQL 1.1
Copy to BDA
This utility enables you to copy relatively static tables from an Oracle database into Hadoop, with the purpose of improving query times.
Oracle NoSQL Database Support
Oracle databases on Oracle Exadata Database Machine can use Oracle Big Data SQL to connect to clusters running Oracle NoSQL Database.
Parquet Support
CDH 5.2 and later versions include Hive 0.13, which supports the Apache Parquet file format. This file format is used by Cloudera Impala and other Hadoop software.
Oracle NoSQL Database
The bdacli admin_cluster
command supports Oracle NoSQL Database nodes that require repair or replacement.
Oracle Big Data Appliance X5-2
Oracle Big Data Appliance 4.1 software supports the Oracle Big Data Appliance X5-2 server hardware.
See "Server Components".
Oracle Big Data Appliance Configuration Generation Utility
This utility generates two new configuration files:
network.json
: Supersedes BdaDeploy.json
. For software upgrades, Mammoth converts the existingBdaDeploy.json
to network.json
. New installations must have network.json
.
networkexpansion.json
: Supersedes BdaExpansion.json
.
CDH Deployment
Mammoth uses parcels instead of RPMs to deploy CDH.
Apache Sentry
Installation of Apache Sentry does not require sentry-provider.ini
as a prerequisite.
The following features are deprecated in this release, and may be desupported in a future release:
Mammoth Reconfiguration Utility
The bdacli
utility supersedes mammoth-reconfig
. The mammoth-reconfig
utility is only needed to change the disk encryption password.
See "bdacli".
MapReduce 1 (MRv1)
YARN (MRv2) supersedes MRv1. Users who want to continue to use MRv1 on Oracle Big Data Appliance versions 3.x and 4.x should contact Oracle Support before using Mammoth to patch or upgrade the software.
Disk Encryption
A new encryption system that is more flexible and robust will replace the current system in an upcoming release.
The following are changes in Oracle Big Data Appliance release 4 (4.0):
Oracle Big Data SQL 1.0.0
Oracle Big Data SQL supports queries against vast amounts of big data stored in multiple data sources, including HDFS and Hive. You can view and analyze data from various data stores together, as if it were all stored in an Oracle database. Support for Oracle Big Data SQL includes the following new features in Oracle Database:
DBMS_HADOOP
PL/SQL package
Hive static data dictionary views
Access drivers for Hadoop and Hive
Oracle Big Data SQL is an installation option, which you can specify using the Oracle Big Data Appliance Configuration Generation Utility.
You can monitor and manage Oracle Big Data SQL using the bdacli
command and Cloudera Manager.
See "bdacli" and Oracle Big Data Appliance Software User's Guide.
Service Migration
The bdacli
utility can migrate services from a failing critical node to a healthy noncritical node. It can also remove failing critical and noncritical nodes from a cluster, and restore them to the cluster after repairs. See "bdacli" and Oracle Big Data Appliance Software User's Guide.
Software Upgrades
Cloudera's Distribution including Apache Hadoop 5.1.0
Cloudera Manager 5.1.1
Perfect Balance 2.2.0
Oracle Data Integrator Agent 12.1.3.0 (for Oracle Big Data Connectors)
Oracle NoSQL Database Zone Support
The Oracle Big Data Appliance Configuration Generation Utility and the mammoth -e command support multiple zones on Oracle NoSQL Database clusters. You can add nodes to an existing zone, or create a new primary or secondary zones.
See "Oracle NoSQL Configuration" and "Mammoth Software Installation and Configuration Utility".
Multiple Rack Clusters
You can now install a cluster on multiple racks using one cluster_name-config.json file.